SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Wassenegger, Michael 
Riedel, Leonhard 
Schiebel, Winfried 
Sanger, Heinz 

(ii) TITLE OF INVENTION: NUCLEIC ACID MOLECULES ENCODING 
POLYPEPTIDES HAVING THE ENZYMATIC ACTIVITY OF AN 
RNA-DIRECTED RNA POLYMERASE (RdRP) 

(iii) NUMBER OF SEQUENCES: 13 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: FISH & NEAVE 

(B) STREET: 1251 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: USA 

(F) ZIP: 10020 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/811,583 

(B) FILING DATE: 05-MAR-1997 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Haley, James F. 

(B) REGISTRATION NUMBER: 27,794 

(C) REFERENCE/ DOCKET NUMBER: MPG-1 

(ix) TELECOMMUNICATION INFORMATION-: 

(A) TELEPHONE: 212-596-9000 

(B) TELEFAX: 212-596-9090 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3731 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tomato 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 194.. 3535 



Express Mail Label 
# Ell 874475 17US 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

GAAATATTCT TTACTTACTT CACCAGGGAT TGACTCATCA CTCCCCTCAA GTCTTTGTGT 60 

GTTGTGATAA TAAATTTGGT TGTGCTTCAG TTTCAGTCAC TACTGCTGGG TAGTTTTTAT 12 0 

TTTGCATAAC TTCAGGGGGT ATTCCAGTTG GTGTTAGCAT TTGAAAGTCG AACTGCACTT 180 

GGAATTTGGC TAG ATG GGA AAG ACA ATT GAG GTT TTC GGA TTC CCT TAT 22 9 

Met Gly Lys Thr lie Gin Val Phe Gly Phe Pro Tyr 
15 10 

CTT CTC TCT GCG GAA GTG GTT AAG TCA TTC TTA GAG AAA TAT ACA GGA 27 7 

Leu Leu Ser Ala Glu Val Val Lys Ser Phe Leu Glu Lys Tyr Thr Gly 
15 20 25 

TAT GGA ACT GTA TGT GCA TTG GAG GTT AAA CAG TCC AAA GGA GGA TCT 325 
Tyr Gly Thr Val Cys Ala Leu Glu Val Lys Gin Ser Lys Gly Gly Ser 
30 35 40 

AGA GCA TTT GCC AAA GTT CAA TTT GCC GAC AAC ATA AGT GCT GAC AAA 37 3 

Arg Ala Phe Ala Lys Val Gin Phe Ala Asp Asn lie Ser Ala Asp Lys 
45 50 55 60 

ATC ATC ACT TTG GCT AAT AAC AGG CTG TAT TTT GGC TCT TCT TAT TTG 421 

lie lie Thr Leu Ala Asn Asn Arg Leu Tyr Phe Gly Ser Ser Tyr Leu 

~~ 65 70 75 

riJ AAG GCT TGG GAA ATG AAA ACT GAT ATT GTC CAA CTG CGG GCA TAT GTG 4 69 

fa Lys Ala Trp Glu Met Lys Thr Asp lie Val Gin Leu Arg Ala Tyr Val 

80 85 90 

GAT CAG ATG GAT GGC ATA ACT TTG AAT TTC GGA TGT CAG ATA TCA GAT 517 
I:^ Asp Gin Met Asp Gly lie Thr Leu Asn Phe Gly Cys Gin lie Ser Asp 

95 100 105 



GAC AAG TTT GCA GTG TTG GGA AGT ACA GAA GTT TCA ATT CAA TTT GGC 5 65 

Asp Lys Phe Ala Val Leu Gly Ser Thr Glu Val Ser lie Gin Phe Gly 
.110 115 120 

ATT GGA TTG AAG AAA TTT TTT TTC TTT TTA TCT AGT GGT TCA GCT GAC 613 

lie Gly Leu Lys Lys Phe Phe Phe Phe Leu Ser Ser Gly Ser Ala Asp 
125 130 135 140 

TAT AAA CTT CAG CTT TCA TAT GAA AAT ATA TGG CAG GTT GTG CTC CAT 661 

Tyr Lys Leu Gin Leu Ser Tyr Glu Asn lie Trp Gin Val Val Leu His 
145 150 155 

CGT CCA TAT GGT CAA AAT GCT CAG TTT CTC CTC ATA CAG TTA TTT GGT 709 

Arg Pro Tyr Gly Gin Asn Ala Gin Phe Leu Leu lie Gin Leu Phe Gly 

160 165 170 

GCT CCT CGG ATC TAT AAG AGA CTT GAA AAC TCC TGT TAT AGC TTC TTT 7 57 

Ala Pro Arg lie Tyr Lys Arg Leu Glu Asn Ser Cys Tyr Ser Phe Phe 

175 180 185 

AAG GAA ACT CCT GAT GAT CAG TGG GTG AGG ACA ACA GAT TTC CCT CCA 805 

Lys Glu Thr Pro Asp Asp Gin Trp Val Arg Thr Thr Asp Phe Pro Pro 
190 195 200 



TCT TGG ATA GGG CTA TCT TCT AGC TTA TGT TTG GAG TTC CGT AGG GGT 853 

Ser Trp lie Gly Leu Ser Ser Ser Leu Cys Leu Gin Phe Arg Arg Gly 
205 210 215 220 

GTT CGT CTT CCA AAT TTC GAG GAA AGT TTT TTC CAC TAT GCA GAA CGT 901 

Val Arg Leu Pro Asn Phe Glu Glu Ser Phe Phe His Tyr Ala Glu Arg 

225 230 235 

GAA AAC AAT ATT ACT TTA GAG ACT GGT TTC ACC TTT TTC GTC TCT CAA 94 9 

Glu Asn Asn lie Thr Leu Gin Thr Gly Phe Thr Phe Phe Val Ser Gin 

240 245 250 

AAA TCG GCT CTG GTT CCC AAT GTC CAG CCT CCG GAA GGA ATT TCA ATT 997 

Lys Ser Ala Leu Val Pro Asn Val Gin Pro Pro Glu Gly lie Ser lie 

255 260, 265 

CCC TAG AAG ATT TTG TTC AAA ATT AGT TCT TTG GTA CAG CAT GGA TGC 104 5 

Pro Tyr Lys lie Leu Phe Lys lie Ser Ser Leu Val Gin His Gly Cys 
270 275 280 

ATA CCT GGG CCA GCA TTA AAT GTC TAG TTT TTC CGA TTA GTT GAT CCT 1093 

lie Pro Gly Pro Ala Leu Asn Val Tyr Phe Phe Arg Leu Val Asp Pro 
285 290 295 300 

CGA AGG AGA AAT GTG GCA TGC ATT GAG CAT GCC TTA GAG AAA CTG TAG 1141 

Arg Arg Arg Asn Val Ala Cys lie Glu His Ala Leu Glu Lys Leu Tyr 

305 310 315 

TAT ATA AAG GAG TGC TGT TAT GAT CCC GTG AGG TGG CTC ACT GAG CAG 118 9 

Tyr lie Lys Glu Cys Cys Tyr Asp Pro Val Arg Trp Leu Thr Glu . Gin 

320 325 330 

TAT GAT GGG TAT CTC AAG GGT AGA CAA CCT CCA AAA TCT CCG TCC ATC 12 37 

Tyr Asp Gly Tyr Leu Lys Gly Arg Gin Pro Pro Lys Ser Pro Ser lie 

335 340 345 

ACT TTA GAT GAT GGG TTG GTG TAT GTA AGA AGG GTC CTA GTA ACA CCA 1285 

Thr Leu Asp Asp Gly Leu Val Tyr Val Arg Arg Val Leu Val Thr Pro 
350 355 360 

TGC AAA GTT TAT TTT TGT GGT CCA GAG GTT AAT GTT TCC AAT CGG GTT 1333 

Cys Lys Val Tyr Phe Cys Gly Pro Glu Val Asn Val Ser Asn Arg Val 
365 370 375 380 

CTC CGC AAT TAT TCT GAA GAG ATA GAT AAC TTT CTT CGT GTT TCT TTT 1381 

Leu Arg Asn Tyr Ser Glu Asp lie Asp Asn Phe Leu Arg Val Ser Phe 

385 390 395 

GTT GAT GAG GAG TGG GAG AAA CTG TAT TCT ACA GAC TTA TTA CCA AAA 1429 

Val Asp Glu Glu Trp Glu Lys Leu Tyr Ser Thr Asp Leu Leu Pro Lys 

400 405 410 

GCA AGT ACT GGA AGT GGT GTC AGG ACA AAC ATC TAT GAG AGG ATC TTA 14 77 

Ala Ser Thr Gly Ser Gly Val Arg Thr Asn lie Tyr Glu Arg lie Leu 

415 420 425 

TCA ACT CTG CGG AAA GGC TTT GTA ATT GGT GAT AAA AAA TTT GAA TTT 1525 

Ser Thr Leu Arg Lys Gly Phe Val lie Gly Asp Lys Lys Phe Glu Phe 
430 435 440 



CTT GCA TTT TCA TCG AGC CAG TTG CGG GAT AAT TCA GTG TGG ATG TTT 1573 

Leu Ala Phe Ser Ser Ser Gin Leu Arg Asp Asn Ser Val Trp Met Phe 
445 450 455 460 

GCA TCA AGA CCT GGC CTT ACT GCA AAT GAT ATA AGA GCT TGG ATG GGT 1621 

Ala Ser Arg Pro Gly Leu Thr Ala Asn Asp lie Arg Ala Trp Met Gly 
465 470 475 

GAT TTT TCG CAG ATC AAG AAT GTC GCA AAA TAT GCT GCC AGA CTT GGT 1669 

Asp Phe Ser Gin lie Lys Asn Val Ala Lys Tyr Ala Ala Arg Leu Gly 

480 485 490 

CAA TCT TTT GGT TCC TCC AGA GAG ACT TTG AGT GTT CTT AGG CAT GAG 1717 

Gin Ser Phe Gly Ser Ser Arg Glu Thr Leu Ser Val Leu Arg His Glu 

495 500 505 

ATT GAA GTT ATT CCC GAT GTA AAG GTT CAT GGA ACC AGC TAT GTC TTT 1765 

lie Glu Val lie Pro Asp Val Lys Val His Gly Thr Ser Tyr Val Phe 
510 515 520 

TCT GAT GGA ATT GGT AAA ATA TCT GGT GAC TTT GCT CAT AGA GTT GCC 1813 

Ser Asp Gly lie Gly Lys lie Ser Gly Asp Phe Ala His Arg Val Ala 
525 530 535 540 

TCA AAA TGT GGC CTT CAA TAT ACC CCA TCT GCT TTC CAG ATT CGT TAT 18 61 

Ser Lys Cys Gly Leu Gin Tyr Thr Pro Ser Ala Phe Gin lie Arg Tyr 
545 550 555 

GGT GGA TAT AAA GGT GTT GTG GGT GTT GAT CCG GAT TCA TCA ATG AAG 190 9 

Gly Gly Tyr Lys Gly Val Val Gly Val Asp Pro Asp Ser Ser Met Lys 

560 565 570 

TTG TCT TTG AGA AAG AGC ATG TCG AAA TAT GAA TCA GAC AAC ATA AAG 1957 

Leu Ser Leu Arg Lys Ser Met Ser Lys Tyr Glu Ser Asp Asn lie Lys 

575 580 585 

TTA GAT GTC CTT GGA TGG AGC AAA TAT CAG CCT TGT TAT CTT AAT CGT 2 005 

Leu Asp Val Leu Gly Trp Ser Lys Tyr Gin Pro Cys Tyr Leu Asn Arg 
590 595 600 

CAA CTG ATT ACG CTC TTG TCT ACA CTT GGA GTG AAA GAT GAA GTT CTC 2 053 

Gin Leu lie Thr Leu Leu Ser Thr Leu Gly Val Lys Asp Glu Val Leu 
605 610 . 615 620 . 

GAA CAG AAG CAA AAG GAA GCT GTA GAT CAG CTT GAT GCT ATC TTG CAT 2101 

Glu Gin Lys Gin Lys Glu Ala Val Asp Gin Leu Asp Ala lie Leu His 
625 630 635 

GAT , TCT TTG AAG GCA CAG GAG GCT TTG GAA TTG ATG TCT CCT GGA GAG 214 9 

Asp Ser Leu Lys Ala Gin Glu Ala Leu Glu Leu Met Ser Pro Gly Glu 

640 645 650 

AAC ACT AAT ATT CTC AAG GCA ATG CTA AAC TGT GGT TAT AAG CCT GAT 2197 

Asn Thr Asn lie Leu Lys Ala Met Leu Asn Cys Gly Tyr Lys Pro Asp 

655 660 665 

GCT GAG CCC TTT CTT TCA ATG ATG TTG CAA ACC TTC CGC GCA TCC AAG 224 5 

Ala Glu Pro Phe Leu Ser Met Met Leu Gin Thr Phe Arg Ala Ser Lys 
670 675 680 



TTG CTC GAT TTG CGG ACT AGA TCA AGA ATA TTT ATT CCA AAT GGA AGA 2293 
Leu Leu Asp Leu Arg Thr Arg Ser Arg lie Phe lie Pro Asn Gly Arg 
685 690 695 700 

ACA ATG ATG GGA TGT TTG GAT GAA TCC AGA ACC TTG GAA TAT GGT CAG 2341 
Thr Met Met Gly Cys Leu Asp Glu Ser Arg Thr Leu Glu Tyr Gly Gin 
705 710 715 

GTG TTT GTT CAG TTT ACT GGT GCT GGA CAT GGA GAG TTT TCT GAC GAT 238 9 

Val Phe Val Gin Phe Thr Gly Ala Gly His Gly Glu Phe Ser Asp Asp 
720 725 730 

TTA CAT CCA TTT AAT AAC AGC AGA TCC ACC AAC AGT AAT TTC ATT CTG 2 4 37 

Leu His Pro Phe Asn Asn Ser Arg Ser Thr Asn Ser Asn Phe lie Leu 
735 740 745 

AAG GGA AAT GTG GTT GTT GCA AAA AAT CCA TGC TTG CAT CCT GGT GAT 24 85 

Lys Gly Asn Val Val Val Ala Lys Asn Pro Cys Leu His Pro Gly Asp 
750 755 760 

ATT CGT GTT TTA AAG GCT GTA AAT GTT CGA GCG CTG CAC CAC ATG GTA 2 533 

lie Arg Val Leu Lys Ala Val Asn Val Arg Ala Leu His His Met Val 
765 770 775 780 

GAT TGT GTT GTA TTC CCT CAG AAA GGA AAA AGA CCT CAT CCG AAT GAA 2 581 

Asp Cys Val Val Phe Pro Gin Lys Gly Lys Arg Pro His Pro Asn Glu 
785 790 795 

TGT TCT GGG AGT GAT TTG GAT GGG GAT ATC TAC TTT GTT TGC TGG GAT 2 62 9 

Cys Ser Gly Ser Asp Leu Asp Gly Asp lie Tyr Phe Val Cys Trp Asp 
800 805 810 

CAA GAC ATG ATC CCG CCA AGG CAA GTC CAG CCG ATG GAA TAT CCT CCA 2 67 7 

Gin Asp Met lie Pro Pro Arg Gin Val Gin Pro Met Glu Tyr Pro Pro 
815 820 825 

GCA CCC AGC ATA CAG TTG GAC CAT GAT GTC ACA ATT GAG GAA GTT GAA 2 725 

Ala Pro Ser lie Gin Leu Asp His Asp Val Thr lie Glu Glu Val Glu 
830 835 840 

GAG TAC TTC ACC AAC TAT ATT GTG AAT GAC AGT TTG GGA ATC ATA GCA 27 73 

Glu Tyr Phe Thr Asn Tyr lie Val Asn Asp Ser Leu Gly lie lie Ala 
845 850 . 855 860 . 

AAT GCC CAT GTC GTA TTT GCA GAC AGA GAA CCT GAT ATG GCC ATG AGT 2 821 

Asn Ala His Val Val Phe Ala Asp Arg Glu Pro Asp Met Ala Met Ser 
865 870 875 

GAT CCA TGC AAA AAA CTT GCT GAG CTC TTT TCA ATT GCA GTG GAC TTT 2 8 69 

Asp Pro Cys Lys Lys Leu Ala Glu Leu Phe Ser lie Ala Val Asp Phe 
880 885 890 

CCA AAG ACT GGT GTT CCC GCT GAA ATA CCA TCT CAG TTG CGC CCT AAA 2 917 

Pro Lys Thr Gly Val Pro Ala Glu lie Pro Ser Gin Leu Arg Pro Lys 
895 900 905 

GAA TAC CCA GAC TTC ATG GAT AAG CCG GAC AAG ACC AGC TAT ATC TCA 2 9 65 

Glu Tyr Pro Asp Phe Met Asp Lys Pro Asp Lys Thr Ser Tyr lie Ser 
910 915 920 



GAA AGA GTT ATT GGA AAG CTT TTC AGG AAA GTG AAG GAC AAA GCA CCT 3013 

Glu Arg Val lie Gly Lys Leu Phe Arg Lys Val Lys Asp Lys Ala Pro 
925 930 935 940 

CAG GCT AGC TCT ATC GCG ACC TTC ACA AGA GAT GTT GCA AGG AGA TCA 3061 

Gin Ala Ser Ser lie Ala Thr Phe Thr Arg Asp Val Ala Arg Arg Ser 
945 950 955 

TAT GAT GCT GAT ATG GAA GTT GAT GGA TTT GAA GAT TAG ATT GAC GAA 310 9 

Tyr Asp Ala Asp Met Glu Val Asp Gly Phe Glu Asp Tyr lie Asp Glu 

960 965 970 

GCT TTT GAC TAC AAA ACT GAA TAT GAC AAC AAG CTG GGT AAT TTA ATG 3157 

Ala Phe Asp Tyr Lys Thr Glu Tyr Asp Asn Lys Leu Gly Asn Leu Met 
975 980 985 

GAC TAC TAT GGC ATA AAA ACA GAG GCT GAA ATA CTT AGT GGT GGC ATT 32 05 

Asp Tyr Tyr Gly lie Lys Thr Glu Ala Glu lie Leu Ser Gly Gly lie 
990 995 1000 

ATG AAG GCA TCA AAA ACT TTT GAC CGC AGA AAA GAT GCT GAG GCC ATT 3253 

Met Lys Ala Ser Lys Thr Phe Asp Arg Arg Lys Asp Ala Glu Ala lie 
1005 1010 1015 1020 

AGT GTT GCT GTG AGG GCC TTG AGG AAG GAG GCA AGA GCC TGG TTC AAG 3301 

Ser Val Ala Val Arg Ala Leu Arg Lys Glu Ala Arg Ala Trp Phe Lys 
1025 1030 1035 

AGG CGT AAT GAT ATA GAT GAC ATG TTA CCA AAG GCT TCG GCT TGG TAC 334 9 

Arg Arg Asn Asp lie "Asp Asp Met Leu Pro Lys Ala Ser Ala Trp Tyr 

1040 1045 1050 

CAC GTT ACA TAT CAT CCT ACA TAT TGG GGT TGC TAC AAT CAG GGG TTG 3397 

His Val Thr Tyr His Pro Thr Tyr Trp Gly Cys Tyr Asn Gin Gly Leu 
1055 1060 1065 

AAA AGA GCT CAT TTC ATT AGC TTT CCC TGG TGT GTT TAT GAC CAG CTA 34 4 5 

Lys Arg Ala His Phe lie Ser Phe Pro Trp Cys Val Tyr Asp Gin Leu 
1070 1075 1080 

ATC CAG ATT AAG AAG GAC AAA GCA CGT AAC AGG CCA GTT CTC AAC TTG 34 93 

lie Gin lie Lys Lys Asp Lys Ala Arg Asn Arg Pro Val Leu Asn Leu 
1085 1090 1095 1100 

TCA TCT CTC AGG GCT CAA CTG AGT CAC AGA TTA GTG TTG AAA 3535 

Ser Ser Leu Arg Ala Gin Leu Ser His Arg Leu Val Leu Lys 
1105 1110 

TGAGATTCCA GTCGAGCGTT AAGCTGATAT ATATATAATG TAATAGGGTG TGATCATAAG 35 95 

AAAACTGTTA TGCATTGTTG ACTACCTTTT GTCTTTAAAA CTGCATGAAG CTGCAACATA 3655 

TATGCAGTAC TCTAAGAAAC AGATGTACAG CTAAGTACTA ATATGTATGT GATTTGAGTT 3715 

TCATCTTTCT TCTAAA 37 31 



(2) INFORMATION FOR SEQ ID NO : 2 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1114 amino acids 



ryPE: amxno acid 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 2: 

Met Gly Lys Thr lie Gin Val Phe Gly Phe Pro Tyr Leu Leu Ser Ala 
15 10 15 

Glu Val Val Lys Ser Phe Leu Glu Lys Tyr Thr Gly Tyr Gly Thr Val 
20 25 30 

Cys Ala Leu Glu Val Lys Gin Ser Lys Gly Gly Ser Arg Ala Phe Ala 
35 40 45 

Lys Val Gin Phe Ala Asp Asn lie Ser Ala Asp Lys lie lie Thr Leu 
50 55 60 

Ala Asn Asn Arg Leu Tyr Phe Gly Ser Ser Tyr Leu Lys Ala Trp Glu 
65 70 75 80 

Met Lys Thr Asp lie Val Gin Leu Arg Ala Tyr Val Asp Gin Met Asp 
85 90 95 

Gly lie Thr Leu Asn Phe Gly Cys Gin lie Ser Asp Asp Lys Phe Ala 
100 105 110 

Val Leu Gly Ser Thr Glu Val Ser lie Gin Phe Gly lie Gly Leu Lys 
115 120 125 

Lys Phe Phe Phe Phe Leu Ser Ser Gly Ser Ala Asp Tyr Lys Leu Gin 
130 135 140 

Leu Ser Tyr Glu Asn lie Trp Gin Val Val Leu His Arg Pro Tyr Gly 
145 150 155 160 

Gin Asn Ala Gin Phe Leu Leu lie Gin Leu Phe Gly Ala Pro Arg lie 
165 170 175 

Tyr Lys Arg Leu Glu Asn Ser Cys Tyr Ser Phe Phe Lys Glu Thr Pro 
180 185 190 

Asp Asp Gin Trp Val Arg Thr Thr Asp Phe Pro Pro Ser Trp lie Gly 
195 200 205 

Leu Ser Ser Ser Leu Cys Leu Gin Phe Arg Arg Gly Val Arg Leu Pro 
210 215 220 

Asn Phe Glu Glu Ser Phe Phe His Tyr Ala Glu Arg Glu Asn Asn lie 
225 230 235 240 

Thr Leu Gin Thr Gly Phe Thr Phe Phe Val Ser Gin Lys Ser Ala Leu 
245 250 255 

Val Pro Asn Val Gin Pro Pro Glu Gly lie Ser lie Pro Tyr Lys lie 
260 265 270 

Leu Phe Lys lie Ser Ser Leu Val Gin His Gly Cys lie Pro Gly Pro 
275 280 285 



vr Phe Phe Arq Leu Val Asp Pro Arq Arq 



Ala Leu Asn Val Tyr Phe Phe Arg Leu Val Asp Pro Arg Arg Arg Asn 
290 295 300 

Val Ala Cys lie Glu His Ala Leu Glu Lys Leu Tyr Tyr lie Lys Glu 
305 310 315 320 

Cys Cys Tyr Asp Pro Val Arg Trp Leu Thr Glu Gin Tyr Asp Gly Tyr 
325 330 335 

Leu Lys Gly Arg Gin Pro Pro Lys Ser Pro Ser lie Thr Leu Asp Asp 
340 345 350 

Gly Leu Val Tyr Val Arg Arg Val Leu Val Thr Pro Cys Lys Val Tyr 
355 360 365 

Phe Cys Gly Pro Glu Val Asn Val Ser Asn Arg Val Leu Arg Asn Tyr 
370 375 380 

Ser Glu Asp lie Asp Asn Phe Leu Arg Val Ser Phe Val Asp Glu Glu 
385 390 395 400 

Trp Glu Lys Leu Tyr Ser Thr Asp Leu Leu Pro Lys Ala Ser Thr Gly 
405 410 415 

Ser Gly Val Arg Thr Asn lie Tyr Glu Arg lie Leu Ser Thr Leu Arg 
420 425 430 

Lys Gly Phe Val lie Gly Asp Lys Lys Phe Glu Phe Leu Ala Phe Ser 
435 440 445 

Ser Ser Gin Leu Arg Asp Asn Ser Val Trp Met Phe Ala Ser Arg Pro 
450 455 460 

Gly Leu Thr Ala Asn Asp lie Arg Ala Trp Met Gly Asp Phe Ser Gin 
465 470 475 480 

lie Lys Asn Val Ala Lys Tyr Ala Ala Arg Leu Gly Gin Ser Phe Gly 
485 490 495 

Ser Ser Arg Glu Thr Leu Ser Val Leu Arg His Glu lie Glu Val lie 
500 505 510 

Pro Asp Val Lys Val His Gly Thr Ser Tyr Val Phe Ser Asp Gly lie 
515 520 525 

Gly Lys lie Ser Gly Asp Phe Ala His Arg Val Ala Ser Lys Cys Gly 
530 535 540 

Leu Gin Tyr Thr Pro Ser Ala Phe Gin lie Arg Tyr Gly Gly Tyr Lys 
545 550 555 560 

Gly Val Val Gly Val Asp Pro Asp Ser Ser Met Lys Leu Ser Leu Arg 
565 570 575 

Lys Ser Met Ser Lys Tyr Glu Ser Asp Asn lie Lys Leu Asp Val Leu 
580 585 590 



Gly Trp Ser Lys Tyr Gin Pro Cys Tyr Leu Asn Arg Gin Leu lie Thr 
595 600 605 



eu Gly Val Lys Asp Glu Val Leu Glu Gin 



Leu Leu Ser Thr Leu Gly Val Lys Asp Glu Val Leu Glu Gin Lys Gin 
610 615 620 

Lys Glu Ala Val Asp Gin Leu Asp Ala lie Leu His Asp Ser Leu Lys 
625 630 635 640 

Ala Gin Glu Ala Leu Glu Leu Met Ser Pro Gly Glu Asn Thr Asn lie 
645 650 655 

Leu Lys Ala Met Leu Asn Cys Gly Tyr Lys Pro Asp Ala Glu Pro Phe 
660 665 670 

Leu Ser Met Met Leu Gin Thr Phe Arg Ala Ser Lys Leu Leu Asp Leu 
675 680 685 

Arg Thr Arg Ser Arg lie Phe lie Pro Asn Gly Arg Thr Met Met Gly 
690 695 700 

Cys Leu Asp Glu Ser Arg Thr Leu Glu Tyr Gly Gin Val Phe Val Gin 
705 710 715 720 

Phe Thr Gly Ala Gly His Gly Glu Phe Ser Asp Asp Leu His Pro Phe 
725 730 735 

Asn Asn Ser Arg Ser Thr Asn Ser Asn Phe lie Leu Lys Gly Asn Val 

740 745 750 

Val Val Ala Lys Asn Pro Cys Leu His Pro Gly Asp lie Arg Val Leu 
755 760 765 

Lys Ala Val Asn Val Arg Ala Leu His His Met Val Asp Cys Val Val 
770 775 780 

Phe Pro Gin Lys Gly Lys Arg Pro His Pro Asn Glu Cys Ser Gly Ser 
785 790 795 800 

Asp Leu Asp Gly Asp lie Tyr Phe Val Cys Trp Asp Gin Asp Met lie 
805 810 815 

Pro Pro Arg Gin Val Gin Pro Met Glu Tyr Pro Pro Ala Pro Ser lie 
820 825 830 

Gin Leu Asp His Asp Val Thr lie Glu Glu Val Glu Glu Tyr Phe Thr 
835 840 845 

Asn Tyr lie Val Asn Asp Ser Leu Gly lie lie Ala Asn Ala His Val 
850 855 860 

Val Phe Ala Asp Arg Glu Pro Asp Met Ala Met Ser Asp Pro Cys Lys 
865 870 875 880 

Lys Leu Ala Glu Leu Phe Ser lie Ala Val Asp Phe Pro Lys Thr Gly 
885 890 '895 

Val Pro Ala Glu lie Pro Ser Gin Leu Arg Pro Lys Glu Tyr Pro Asp 
900 905 910 



Phe Met Asp Lys Pro Asp Lys Thr Ser Tyr lie Ser Glu Arg Val lie 
915 920 925 




10 



# 

1 Ala 



Gly Lys Leu Phe Arg Lys Val Lys Asp Lys Ala Pro Gin Ala Ser Ser 
930 935 940 



lie Ala Thr Phe Thr Arg Asp Val Ala Arg Arg Ser Tyr Asp Ala Asp 
945 950 955 960 

Met Glu Val Asp Gly Phe Glu Asp Tyr lie Asp Glu Ala Phe Asp Tyr 
965 970 975 

Lys Thr Glu Tyr Asp Asn Lys Leu Gly Asn Leu Met Asp Tyr Tyr Gly 
980 985 990 

lie Lys Thr Glu Ala Glu lie Leu Ser Gly Gly lie Met Lys Ala Ser 
995 1000 1005 



Lys Thr Phe Asp Arg Arg Lys Asp Ala Glu Ala lie Ser Val Ala Val 
1010 1015 1020 



Arg Ala Leu Arg Lys Glu Ala Arg Ala Trp Phe Lys Arg Arg Asn Asp 
1025 1030 1035 1040 

lie Asp Asp Met Leu Pro Lys Ala Ser Ala Trp Tyr His Val Thr Tyr 
1045 1050 1055 

His Pro Thr Tyr Trp Gly Cys Tyr Asn Gin Gly Leu Lys Arg Ala His 
1060 1065 1070 

Phe lie Ser Phe Pro Trp Cys Val Tyr Asp Gin Leu lie Gin lie Lys 
1075 1080 1085 



^'i Lys Asp Lys Ala Arg Asn Arg Pro Val Leu Asn Leu Ser Ser Leu Arg 

1090 1095 1100 

5 Ala Gin Leu Ser His Arg Leu Val Leu Lys 

O 1105 1110 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 amino acids 
s^'^ (B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Arg Thr Met Met Gly Cys Leu Asp Glu Ser Arg Thr Leu Glu Tyr Gly 
15 10 15 

Gin Val Phe Val Gin Phe Thr Gly Ala Gly His Gly Glu Phe Ser Asp 
20 25 30 

Asp Leu His Pro Phe Asn Asn Ser Arg Ser Thr Asn Ser Asn Phe lie 
35 40 45 



Leu Lys Gly Asn Val Val Val Ala Lys Asn Pro Cys Leu His Pro Gly 
50 55 60 



Asp He Arg Val Leu Lys Ala Val Asn Val Arg Ala Leu His His Met 
65 70 75 80 



Val Asp Cys Val Val Phe Pro Gin Lys Gly Lys Arg Pro His Pro Asn 
85 90 95 

Glu Cys Ser Gly Ser Asp Leu Asp Gly Asp lie Tyr Phe Val Cys Trp 
100 105 110 

Asp Gin Asp Met lie Pro Pro Arg Gin Val Gin Pro Met Glu Tyr Pro 
115 120 125 

Pro Ala Pro Ser He Gin Leu Asp His Asp Val Thr He Glu Glu Val 
130 135 140 

Glu Glu Tyr Phe Thr Asn Tyr He Val Asn Asp Ser Leu Gly He He 
145 150 155 160 

Ala Asn Ala His Val Val Phe Ala Asp Arg Glu Pro Asp Met Ala Met 
165 170 175 

Ser Asp Pro Cys Lys Lys Leu Ala Glu Leu Phe Ser He Ala Val Asp 
180 185 190 

Phe Pro Lys Thr Gly Val Pro Ala Glu He Pro Ser Gin Leu Arg Pro 
195 200 205 

Lys Glu Tyr Pro Asp Phe Met Asp Lys Pro 
210 215 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CATAACGAAT CTGGAAAGCA GATGG 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
GATGAATCCG GATCAACACC CACAC 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 
GGGTGCTGGA GGATATTCCA TCGGC 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
CTTCACCAGG GATCCACTCA TCACTCCCCT CAAG 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 
GCATAACTTC AGGGGGGATC CAGTTGGTGT TAGC 
(2) INFORMATION FOR SEQ ID NO : 9 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GCAGCTTCAT GCAGATCTAA AGACAAAAGG TAGTC 35 
{2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ser Asn Arg Val Leu Arg Asn Tyr Ser Glu Asp lie Asp Asn 
15 10 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Ala Ser Lys Thr Phe Asp Arg Arg Lys Asp Ala Glu Ala lie 
15 10 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



# 



14 

(ii) MOLECULE 'TTpE: peptide 




nj 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Glu Gin Tyr Asp Gly Tyr Leu Lys Gly Arg Gin Pro Pro Lys Ser Pro 
15 10 15 

Ser 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS:- 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Val Phe Pro Gin Lys Gly Lys Arg Pro His Asn Glu Cys 
15 10 



Q 

f i I 



